Combining Wavelets and Computational Intelligence Methods with Applications on Multi-class Classification Datasets
نویسنده
چکیده
In this paper, we propose a novel algorithm for wavelet feature extraction as input to a supervised Multi-Class Classifier to improve classification performance. In particular, to select the best wavelets coefficient features, we first compute the energy-based variance distribution from wavelets coefficients at different subbands as well as the entropy-based fuzzy measures associated with the training instances. Once we get these entropy-based fuzzy measures associated with the different subsets of wavelets subbands, we apply the Möbius Transform to these entropy-based fuzzy measures to extract the Multivariate Mutual Information associated with the different subsets of wavelets subbands. The goal of these measures is twofold: assign weights (based on the wavelets information content) to all subsets of wavelets subbands and extract the independent (in terms of the multivariate mutual information) subsets of wavelets subbands. In our case, the optimal subsets of wavelets subbands as wavelets features vectors to train a Bayesian Network Model are those which provide a multivariate mutual information equal to zero. Experimental results with the multi-class SRBCT cancer dataset, show that our proposed approach achieves lower classification error in comparison with other methods proposed in the literature. Keywords— fuzzy measures, mutual information, wavelets.
منابع مشابه
MLIFT: Enhancing Multi-label Classifier with Ensemble Feature Selection
Multi-label classification has gained significant attention during recent years, due to the increasing number of modern applications associated with multi-label data. Despite its short life, different approaches have been presented to solve the task of multi-label classification. LIFT is a multi-label classifier which utilizes a new strategy to multi-label learning by leveraging label-specific ...
متن کاملFace Recognition using an Affine Sparse Coding approach
Sparse coding is an unsupervised method which learns a set of over-complete bases to represent data such as image and video. Sparse coding has increasing attraction for image classification applications in recent years. But in the cases where we have some similar images from different classes, such as face recognition applications, different images may be classified into the same class, and hen...
متن کاملExploiting Associations between Class Labels in Multi-label Classification
Multi-label classification has many applications in the text categorization, biology and medical diagnosis, in which multiple class labels can be assigned to each training instance simultaneously. As it is often the case that there are relationships between the labels, extracting the existing relationships between the labels and taking advantage of them during the training or prediction phases ...
متن کاملBreast Cancer Diagnosis from Perspective of Class Imbalance
Introduction: Breast cancer is the second cause of mortality among women. Early detection is the only rescue to reduce the risk of breast cancer mortality. Traditional methods cannot effectively diagnose tumor since they are based on the assumption of well-balanced dataset.. However, a hybrid method can help to alleviate the two-class imbalance problem existing in the ...
متن کاملProtein Secondary Structure Prediction Using Support Vector Machines and a New Feature Representation
Knowledge of the secondary structure and solvent accessibility of a protein plays a vital role in the prediction of fold, and eventually the tertiary structure of the protein. A challenging issue of predicting protein secondary structure from sequence alone is addressed. Support vector machines (SVM) are employed for the classification and the SVM outputs are converted to posterior probabilitie...
متن کامل